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Abstract. This work introduces a stochastic model for the spread 
of a virus in a cell population where the virus has two ways of 
spreading: either by allowing its host cell to live on and duplicate, 
or else by multiplying in large numbers within the host cell, causing 
the host cell to burst thereby letting the viruses enter new unin- 
fected cells. The model is a kind of interacting Markov branching 
process. We focus in particular the probability that the virus pop- 
ulation survives and how this depends on a certain parameter A 
which quantifies the 'aggressiveness' of the virus. 

Our main goal is to determine the optimal balance between ag- 
gressive growth and long-term success. Our analysis shows that 
the optimal strategy of the virus (in terms of survival) is obtained 
when the virus has no effect on the host cell's life-cycle, corre- 
sponding to A = 0. This is in agreement with experimental data 
about real viruses. 
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1. Introduction 

A virus is a simple parasitic organism consisting of compacted genetic 
material in a protein or lipid vessel. Viruses prey on living cells, such as 
bacterial or human cells, by penetrating the membrane of the cell and 
transferring their genetic material into the host. In order to multiply, 
the virus has two basic possibilities. The first option is for the virus to 
temporarily incorporate its genetic material in the host genome, and 
thereby be passively replicated along with the latter. The other option 
is to seize the host's replication machinery and aggressively replicate, 
thereafter releasing its progeny in the surrounding medium. The 'free 
virions' must then attach to new host cells within a short time in order 
to survive. For many viruses, this process necessarily involves bursting 
the host cell, thereby killing it. 
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The technical term for the event that a virus bursts its host cell is 
lysis, and one says that the virus lyses the host cell. A virus which is in- 
corporated into, and passivlely replicated along with, the host genome 
is said to be in the lysogenic state, or to employ the lysogenic strategy. 
Sometimes one speaks loosely of 'the lytic strategy' to denote that a 
virus 'becomes lytic', that is to say actively lyses the host cell. A lyso- 
genic virus will eventuelly become lytic; in fact, it is well-established 
experimentally [13] that viruses in the lysogenic state will revert to the 
lytic strategy if the host cell is under stress and in danger of dying, 
enabling the virus to find a 'safer' host. 

We introduce a stochastic model to investigate this behaviour. The 
model is a two-dimensional Markov process {X(t), Y{t))t>o, where X{t) 
is the number of 'healthy cells' at time t, and Y{t) is the number 
of 'infected cells' (i.e. cells having virus in them). Both components 
{X(t))t>Q and (Y(t))t>o behave in many ways like branching processes, 
although there are dependencies between them. A healthy cell is re- 
placed by a random number of new healthy cells at rate 1. This random 
number is independent of other events and drawn from a distribution 
{Pk)k>o- Infected cells behave similarly, although they are replaced by k 
new cells at rate pkiik >1, while they are replaced by new cells (die) 
at the higher rate po + A. Here A > is a parameter that reflects the 
negative impact of the virus on the hosts lifelength. When an infected 
cell dies (i.e. is replaced by new cells), it bursts (lyses) and releases 
'free virions'. These free virions immediately enter a random number 
of healthy cells, thus converting them into infected cells. The number 
of new infections is independent of all other events, and is drawn from 
a distribution (7fe)fe>o- The model is defined in detail in Section 2.1. 

We are concerned with a fundamental question about the virus' re- 
productive strategy, namely: what is the optimal level of 'aggressive- 
ness' (balance of lysis to lysogeny) from the point of view of the virus? 
Here we interpret 'optimality' as maximizing the chance of the virus 
establishing itself in the cell population and, ultimatly, surviving in 
the long-term. We therefore study the extinction probability r) of the 
infected process {Y{t))t>o (see Definition 2.1). We are interested in rj, 
or rather 1 — as an indicator of the 'fitness' of the virus, and are 
mainly concerned with how it depends on A. This is because A governs 
the relative rate of lysis events, and is thus a measure of the 'level of 
aggressiveness' of the virus. 

For the experimentally well-studied virus Lambda, the lysogenic 
state appears overwhelmingly stable. Once in the lysogenic (dormant) 
state, it has been found very unlikely to spontaneously switch to the 
lytic state [4, 12]: a spontaneous transition to the lytic state occurs 
about once in 10^ generations [4]. This is lower than the mutation rate 
of the incorporated viral genome, which is once in 10^ to 10^ genera- 
tions [12]. It is natural to ask if this lysogenic stability is an advantage 
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for the success of the virus infection? For the virus Lambda, a choice 
between lytic and lysogenic also occurs at the moment of infection. We 
focus mainly on the virus' decision after it has been incorporated in 
the host genome, but in Section 5.3 briefly deal also with the decision 
at the moment of infection. 

Our model is of course a simplification of real virus populations. 
Indeed, we make the following basic simplifications: life-lengths of cells 
are assumed to be independent and exponentially distributed, spatial 
separation and locations of cells arc not taken into account, and the 
number of new infections caused by one infected cell is assumed not 
to vary with the population sizes {X(t))t>o and {Y{t))t>o (except if 
all remaining healthy cells are to become infected). The advantage of 
making such simplifications is that a detailed and rigorous analysis can 
be performed, hopefully highlighting general principles that can then 
form the basis for more realistic modelling. 

It is well-known [3, 6] that a branching process either dies out, or 
grows exponentially fast for all time. Thus a branching process is not 
a realistic long-term model for population size, in light of the limited 
resources in the real world. Instead, we see the surivival probability 1 — 
77 as an indicator of the probability that a virus population establishes 
itself in a population of healthy cells in the first place. In this sense our 
model is primarily relevant for the early stages of a virus infection and 
the competition between two growing populations. Since our model 
is concerned with qualitative properties of reproductive strategies, not 
with numerical estimates of population size, we do not see the use of 
branching processes as a limitation. In what follows we will refer to 
1 — as the 'survival probability' as this is the appropriate term in 
the context of branching process theory, bearing in mind that when 
interpreting our results in terms of real viruses one should rather think 
of 1 — 77 as an indicator of the relative success in establishment and 
proliferation. 

Recall the concept of stochastic ordering of probability vectors: if 
— (7rfe)fe>o and tt' = {T^k)k>o are probability vectors then we say that 
tt' is stochastically larger than tt if 



for all k > 0. We denote this by tt ^ tt'. The following is the main 
result of this paper. 

Theorem 1.1. For A > 0, 70 = and any starting conditions X{0) > 
1,^(0) > 1, we have thatrj — rj{X, {'jk)k>o) is monotonically increasing 
in A and {jk)k>o- 

Thus, loosely speaking, the virus maximizes its survival probability 
by being as passive as possible (i.e. when A = 0). This is in agreement 




j>k 



j>k 
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with the observed stabihty of the lysogenic state for real viruses, see 
Section 5. However, the full details of how the 'fitness' 1 — r] depends 
on A are complex, and depend on the other parameters of the process. 
Furthermore, simulations and heuristic arguments suggest that mono- 
tonicity in A may hold under weaker assumptions (Example 3.4 and 
Section 5.2) than in Theorem 1.1, but interestingly r] is not monotone 
in A for all choices of the other parameters (Proposition 3.3). We have 
not been able to find a counterexample to optimaly of A = 0. 

Thus this paper highlights the principle that in order to achieve long- 
term survival it may be better to 'be kind' to your host environment, 
even if this hampers your short-term expansion. We do not aim to give 
a complete and final picture, however, and there are many interesting 
questions and research directions that fall outside the scope of the cur- 
rent paper. For instance, in [5], the rigorous mathematical treatment 
of the model will be continued as will be explained in more detailed in 
later sections. Other possible directions include studying real life data, 
and in the cases when a rigorous mathematical treatment is unfeasible, 
use simulation studies to connect the proposed model to these data. 

Most previous models studying the spread of viruses are non-stochastic 
and formulated in terms of differential equations. A notable example 
is that of [15]. We prefer to formulate our model in microscopic terms, 
deducing macroscopic properties explicitly from our assumptions about 
the interactions of the particles involved. To our knowledge the cur- 
rent model has not been studied before but stochastic models of similar 
'nature' appear for example in predator-prey models [18] and epidemic 
models, in particular models for competing epidemics [10]. 

Acknovi^ledgements: The authors would like to thank Leonid Hanin 
for helpful comments on a draft of this article. 

2. The model 

2.1. Definition. Let {pk)k>o and (7fc)fc>o be probability distributions 
on the nonnegative integers, and let A > 0. We assume throughout 
that the means J2k>o ^^"^ Tlik>o are finite. We exclude the 
(degenerate) case when pi = 1; in fact the reader may for convenience 
assume that pi = 0, since this only amounts to a time-change. 

The continuous-time Markov chain {X{t),Y{t))t>o, taking values in 
Z^, was informally described in Section 1. To recapitulate the main 
points, each healthy cell is replaced by A; > new healthy cells at rate 
Pk. Being replaced hy k = new cells corresponds to dying. Each 
infected cell is replaced by A; > 1 new infected cells at rate p^. When 
an infected cell dies, which occurs at rate po + X, a random number of 
healthy cells are converted into infected cells. If t is the time of such 
an event, we draw a random variable from the distribution (7jfc)jfc>o 
independently of other events. If F^ < X{t) we simply declare F^ of 
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previously healthy cells to be infected, while if Fj > X{t) we declare all 
previously healthy cells to be infected. To define this process formally, 
we list the different possible jumps in Table 1. 





Transition 


Rate 


(i) 


For A; > 0: 






{x,y) {x + k-l,y) 




(ii) 


For k>l: 






{x,y) {x,y + k - 1) 


yPk 


(iii) 


For k>Q: 




{x, y) ^ {x - {x A k)),y - 1 + {x A k)) 


y{Po + A) Tit 



Table 1. Transition rates for the process 
(X(i),y(i))t>o, valid for x,y>0. 

Note that, for certain combinations of x, y and k, the same transition 
occurs mutlipic times in Tabic 1. The correct interpretation is to add 
the corresponding rates. For example, the transition {0,y) i-^ (0, y — 1) 
occurs at rate y{po + A) = Xlfc>o^(^'o + ><hk- 

To avoid trivial cases, we assume throughout that X(0),Y{0) > 1. 
Biologically it might be most relevant to consider the case when Pk — 
for A; > 3, but none of our results depend on any special assumptions 
about {pk)k>o so we will consider general distributions. 

We now state some immediate properties of the model. If it were 
the case that Y(t) = 0, then healthy cells would evolve as a Markov 
branching process, with intensity 1 and offspring distribution {pk)k>o- 
Similarly, if X{t) = for some t, then (Y(t + s))s>o would behave 
like a Markov branching process with the higher intensity (1 + A) and 
an offspring distribution (p^)jk>o derived from {pk)k>o by placing more 
mass on k = (see (1) below). When both components are positive, as 
transition rate (iii) tells us, then healthy cells may turn into infected 
cells. This scenario hence 'helps' the process {Y{t))t>o and 'hurts' the 
process {X{t))t>o. 

Note that the virus is assumed not to change the offspring distri- 
bution of surviving cells. This, together with the increased mortality 
rate of infected cells, determines the form of the transition rates above, 
see (2). Also note that the random number drawn from the distribution 
(7fc)fc>o is the number of new infections due to a lysis event, rather than 
the number of 'free virions'. In the present work we consider only this 
simplified formulation, leaving more realistic modifications for future 
work. 

2.2. The extinction probability. As explained in Section 1, we view 
the extinction probability of the process {y{t))t>o as an indicator of the 
fitness of the virus: 
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Definition 2.1. Let r] = lim^^oo = 0) = P{Y{t) 0) denote 

the extinction probabihty of the process {Y{t))t>o- 

Thus 'small' rj corresponds to 'high fitness'. Note that is a function 
of the parameters {pk)k>o, (7fe)fe>o, A, X{0) and Y{0). For the reasons 
given in Section 1 wc are mainly interested in how rj depends on A and 
the distribution (7fc)fc>o- 

Remark 2.2. From a purely mathematical point of view the allowed 
range of values of A is A > —po. It is not hard to see that Tj^—po) = 
and ?7(A) — )■ 1 as A — )■ oo. However, viruses being parasites, from 
a biological point of view it seems unlikely that infected cells should 
have longer life-length than healthy cells. Throughout the rest of the 
paper we will therefore assume that A > 0. 

3. Additional results 

Let ^ be a random variable with distribution (pa;)a:>o- As mentioned 
above, if X{t) — for some t, then from that time onwards, the process 
{Y{t + s))s>o is a standard branching process. Its intensity is then l + A 
and its offspring distribution (p^)fe>o is given by: 

(1) Po = (Po + A)/(l + A) and p[ = Pk/(1 + A) for A; > L 

Let ^' be a random variable with distribution {p'k)k>o, ^-nd note that 
for A > 0, p(j = P{^' = 0) > P{C = 0) = Po, whereas 

(2) P{e = A; I 0) = P(e = A; I e 7^ 0) for all A; > 1. 

This choice of {p'k)k>o is the only one, given A, such that the intensity 
at which a cell gives birth to A; > 1 new cells is the same for both 
{X{t))t>o and {Y{t))t>o. 

Let r be a random variable independent of with distribution 
(7fe)fe>o- Write 

(3) ^^^^e+^■^e-o}. 

Then has distribution {qk)k>o, where 

(4) go = , and Qk = ^-^-^^^ for A; > 1. 

Write 

(5) Tx inf{t > : X{t) = 0} 

for the (possibly infinite) time when the healthy population becomes 
extinct. The following summarizes some of the previous discussion: 

Proposition 3.1. 

(1) IfY{t) = for some t > 0, then {X{t + s))s>o is a Markov 
branching process with intensity 1 and offspring distribution 

ipk)k>o; 
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(2) If X{t) = for some t > 0, then {Y{t + s))s>o is a Markov 
branching process with intensity 1 + A and offspring distribution 

ip'k)k>o; 

(3) The process (Y{t))Q<t<Tx ^■^ ^ (stopped) Markov branching pro- 
cess with intensity 1 + A and offspring distribution {qk)k>o- 

For proofs of the following basic facts about branching processes, 
see [3, 6, 7]. Consider an arbitrary Markov branching process (W{t))t>o 
with lifelength intensity a, and offspring distribution {zk)k>o such that 
the mean X]ifc>o is finite. Let Z he a random variable with distri- 
bution {zk)k>o- The number 

l^^a-{EiZ)-l), 

is called the Malthusian parameter of the process {W{t))t>o- Let A — 
{W{t) = for some t > O} be the event of extinction. It is well-known 
that P{A) = 1 if /X < 0. If > then 

lim = n, almost surely on the complement A^. 

t—^oo t 

Write a and /3 for the Malthusian parameters of branching processes 
with respective intensities 1 and 1 -|- A, and offspring distributions 
{Pk)k>o ^-nd {<lk)k>07 in parts (1) and (3) of Proposition 3.1. Thus 
a is the parameter for the uninfected population in the absence of in- 
fected cells, and f3 for the infected population in the presence of a very 
large uninfected population. We have that 

(6) a = EiO-h /3 = (1 + A)(i?(^)-1), 
where ^ and ip are as above. Using (3), we find that 

(7) /3 = a+poE{T) + X{E{r)-l). 

Proposition 3.2. Suppose po > or X > 0. Then P{Tx < oo) — 1 if 
and only if either a < or 70 = 0. 

We do not prove this result in detail here, but note that the suf- 
ficiency of the condition a < is immediate from the properties of 
branching processes described above. If a > 0, the necessity of the 
condition 70 = is immediate, since if 70 > there is positive chance 
that Y{t) = at the time of the first transition, while the healthy 
process survives. Intuitively, the sufficiency of the condition 70 = 
follows from the fact that Y{t) is 'immortal' as long as X{t) 7^ and 
(from (7), since E(r) > 1) grows much faster than (X{t))t>0: meaning 
that there will be very many infection events for large t. It is not dif- 
ficult to make this intuition rigorous, in fact this will be proved in the 
upcoming paper [5]. 

Recall the main result Theorem 1.1. In words, the assumption 70 = 
says that a lysis event always leads to new infections; thus the failure 
rate of infections is zero. The following proposition shows that we 
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cannot remove the condition 70 = from Theorem 1.1, and still come 
to the same conclusion. We will address this further in Section 5. 

Proposition 3.3. 

(1) There exist {pk)k>o, ^ and probability vectors 'y^^'^ ^ 7^^-' so that 

(2) Furthermore, there exist {pk)k>o, (7fc)fc>o (md Ai < A2 so that 
7]{X,) > r]{X2). 




X 

Figure 1. Illustration of Example 3.4. 

Proposition 3.3 is proved in Section 4. Our proof of the second part 
requires taking E{T) < 1. It is natural to guess that the condition 
7o = in Theorem 1.1 can be replaced by the condition -E'(r) > 1: 
infections are successful 'on average'. This is still an open problem, 
but the following simulation supports this guess. 

Example 3.4. Figure 1 shows estimated values for //(A) when po = 
1/4, p2 = 3/4, 7o = 9/20 and 72 = 11/20. Note that E{T) > 1; 
the simulation suggests that rj is increasing in A. The estimates were 
obtained by running, for each value of A, the process 10^ times, for 
2 ■ 10^ transitions each. The estimate for rj is the fraction of runs where 
Y did not die out. (Usually Y was either or very large at the end 
of a run.) For reasons similar to the second part of Proposition 3.3, 
the true value of ri{X) for A > 0.5 is 1. However for A = 0.5 the 
process {Y{Tx + s))s>o is a critical branching process so the time until 
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extinction is large, accounting for the small deviation from 1 in the 
graph. 

3.1. Initial conditions and connection to ODE. Here is a brief 
informal discussion about how rj depends on the starting condition 

(X(0),y(0)) when A and (7fc)fc>o arc fixed. We focus on two start- 
ing conditions of potential interest, namely {X{0),Y(0)) = (n, 1) and 
(X(0), Y{0)) = (n, m) where we think of n and m as large, but n ^ m. 
The first situation is where one cell, surrounded by healthy cells, is in- 
fected. The second situation would correspond to the case where a large 
number of healthy cells are encountered by a large number of infected 
cells. Formal results would be stated asymptotically as m, n — )■ oo. 

The 'take-home message' of Proposition 3.1 is that {Y{t))t>o essen- 
tially behaves like a branching process, with a Malthusian parameter 
that depends on whether X(t) = or X(t) > 0. If X{t) > the 
Malthusian paramter is /3 = a+pQE{r) + X{E{r) — 1) as already stated 
in (7); if X{t) = the Malthusian parameter is (3' = a — X, as is easily 
deduced from the second part of Proposition 3.1. Here a — E{^) — 1 
is as given in (6). Note that /3' < p. 

The most interesting behaviour occurs if a, /3 > 0, which we assume 
henceforth. We also assume that 70 > 0, since the case 70 = is 
easily analyzed using Proposition 3.2 and Theorem 1.1. The main 
qualitative differences in behaviour occur according as (3 > a or /3 < a; 
in the former case there are two interesting subcases, namely f3' < 
and /3' > 0. 

Let us start by considering the initial condition X{Q) = n, Y{Q) = m. 
Since Y{0) — mis large and /3 > 0, most hkely the infected population 
starts growing. Roughly speaking, Y(t) ~ me^*. Suppose first (3 > a. 
The healthy population X{t) will then at most be of order ne"* <^ 
me'^*, and eventually Y{t) will so far exceed X{t) that infections will 
overwhelm the healthy population, so that we get X{to) = for some 
time to- From then on the infected population will have Malthusian 
parameter /?'. If /?' < this means that f] will equal 1, whereas if > 
then Tj will be close to 0, since Y{to) is large. On the other hand, if 
a > /3 then typically X{t) will be so much larger than Y{t) that the 
healthy population does not 'feel' the presence of the infection. Since 
/3 > and ^(0) = m is large it follows that 77 is close to 0. 

Now consider the case {X{0),Y{0)) = (n, 1). This is similar to the 
case (X(0),F(0)) = {n,m), except that there is a considerable chance 
(probability at least 7o(po + A)/(l -|- A), this being the probability of a 
lysis leading to no infections) that the infected process dies out in short 
time. However, if Y(t) does start growing then its size will eventually 
be in the order of e'^*. From then on the same intuition as for the 
starting condition (n, m) is valid. 

Another way to understand the behaviour described above, in par- 
ticular the starting condition {n,m), is to look at the ODE model 
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corresponding to our model. Letting S = {po + \)E(T), this is given by 



It is not hard to see that x{t) eventually reaches the absorbing state 
if ^ > CK, whereas x{t) — )■ +00 if a > ^ and x{0)/y{0) = n/m is 
sufficiently large, corresponding to whether the infection takes over or 
not. We will not study the ODE model further since it cannot give 
information about survival probabilities. 



In this section we prove Theorem 1.1 and Proposition 3.3. In what 
follows we will work with two processes iZi{t))t>o — {Xi{t),Yi{t))t>o, 
with parameters {pk)k>o, K and (7*-*-*)a;>o, « = 1,2, respectively. We 
write Ti = mi{t > : Xi{t) = 0}. 

Theorem 4.1. Let < Ai < A2; 7^^^ ^ 7^^^ and'j^^^ = = 0. Then 
there is a coupling of the processes {Zi{t))t>o = l,2j such that the 
following hold almost surely: 

(1) T2 < Ti, 

(2) Y2{t) > Y^(t) for all t < T2, 

(3) X2{t) < Xi{t) for all t < T2, and 

(4) Xi{t) + Yi{t) > X2{t) + Y2{t) for all t > 0. 

Before we prove Theorem 4.1, we show how Theorem 1.1 follows, 
almost immediately, from Theorem 4.1 and Proposition 3.2. 

Proof of Theorem 1.1. By Proposition 3.2 we have Ti < 00 almost 
surely. For t > Ti it follows from parts (1) and (4) of Theorem 4.1 
that Yi{t) > Y2{t), so that P(Yi{t) ^ 0) < P(Y2{t) ^0). □ 

Proof of Theorem 4-1- The basic strategy is to 'twin' cells in the pro- 
cess {Zi{t))t>o with cells in the process {Z2{t))t>o so that 'events' in 
one process correspond with 'events' in the other process. Intuitively, 
the reason we can acheive a coupling as the one claimed is that only 
'lysis events' occur at a higher rate in the second process: these events 
always increase the infected population, but decrease both the healthy 
and total populations. Here are the details. 

It will be convenient to think of Zi(t), Xi{t), and Yi{t) {i = 1,2) as 
sets of individual cells. Formally, we could label the elements in the set 
Zi{t) by (xiJ),... (x|Xi(t)|, i), {yi, i), • • • , {y\Yi(t)\:i)- However, this nota- 
tion would quickly become cumbersome, and so we will use a somewhat 
less formal, although still rigorous, approach. We will describe the tran- 
sitions of the coupled process {Zi{t), Z2{t))t>o at the level of pairs (a, b) 



x{t) = ax{t) - 5y{t), y{t) = /3y{t), 
as long as x{t) > 0. U a ^ (3 the solution is 




4. Proofs 
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of individual cells, where a e Zi{t) and b G Z2{t). For each t >0, each 
element of Zi (t) U ■Z'2 (t) is required to belong to a unique such pair, and 
we say that a and b are twinned if they belong to the same pair. We 
allow the possibility 6 = 0, in which case we say that a is untwinned 
(the possibility a = will not occur). We will say that a cell is of type 1 
(respectively, type 2) if it belongs to Zi{t) (respectively, Z2{t)). 

It is a standard consequence of the ordering 7^^^ ■< 7*^^) that we may 
couple two random variables F"^^) and F*^^) such that F*^^) has law 7*^^^ 
and F*^^) has law 7*^^) and P{V^^^ < F*^^^) = 1. We assume henceforth 
that {(Fi^j, F2,i)}i>i is an i.i.d. sequence such that Fi^j < F2,i for every 
i, and that Fi^j has law 7^^^ and r2,i has law 7^^^. In the construction 
that follows below, if there is a lysis event at some time r, we let /(r) 
be the smallest i such that (Fi j, F2,i) has not previously been used in 
the construction. 

We start by describing the coupling {Zi{t), Z2{t))t>o up to time Ti A 
T2 (it will transpire that Ti AT2 — T2). It will be convenient to think 
of healthy cells as coloured blue and infected cells as coloured red. The 
allowed colour combinations before time Ti A T2 are the following (the 
right column introduces notation for the number of pairs of each colour 
combination) : 



Colour 


Number 


{B,B) 


b2 


iR,R) 


r 


{B,R) 


bi 


(^,0) 


bo 



Now we turn to describing the transitions. 

(i) Any pair (a, b) is replaced by A; > 1 new pairs at rate pk] all the 
new pairs have the same colour combination as the original pair 
(a, b) (also in the case 6 = 0). 

(ii) A pair of colour {B, B) or [B, 0) is deleted at rate po- 

(iii) A pair (a, h) — {R, R) can give rise to the following additional 
transitions. 

Firstly, at rate po + Ai it has a type-l-lysis. If r is the time of 
such an event, we first delete (a, 6), and then one of the following 
cases occur. 

Case 1: ^2,i{t) ^ ^2- Then we take '^ij{t) pairs of colour (5, B) 
and change their colour to (i?, i?), and we take F2,/(r) — "^ijir) of 
the remaining [B, i?)-pairs and change their colour to [B, R). 

Case 2: Fi^/(^) > 62- Then we start by changing all the 62 pairs 
of colour {B,B) to {R,R). Let F'^^^^^ := Fi,7(^) - ^2 > 0, and 
proceed by changing F'^^ j^^-^ A bi pairs of colour {B, R) to colour 
{R, R). Proceed by letting F'^' j^^-^ := {T[ j^^-^ — 61) VO and changing 
^'1 7(t) ^ ^0 pairs of colour (B, 0) to {R, 0). Note that in this case 
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we arrive at time T2 < Ti, and that this inequahty is strict if 
and only if there remain pairs of colour {B, R) or {B, 0) after the 
changes are made. In particular, we only create any {R, 0)-pairs 
if we arrive at time T2. 

Case 3: T2j{t) > &2 > ri^/(,-). Then we first take ^i,i(t) pairs 
of colour {B, B) and change their colour to [R, R), and then take 
the remaining {h-2 — ^\j[T)) pairs of colour (S, B) and change them 
to {B, R). In this case wc arrive at time T2 < Ti. 

Secondly, at rate A2 — Ai the pair (a, b) has a type-2-lysis. If r 
is the time of such an event: 

the original pair (a, b) stays unaltered; 
one pair of colour {B, B) gets replaced by (5, 0); 
(r2,/(T) — 1) A (62 — 1) of the remaining (62 — 1) pairs 
of colour [B, B) have their colour changed to {B, R). 

Note that we arrive at time T2 < Ti if ^2,i{t) > &2- 
(iv) Finally, a pair (a, b) of colour [B, R) can give rise to the following 
additional transitions. 

Firstly, at rate po a type- 1- death occurs. If r is the time of 
such an event, then (a, b) is deleted and r2,/(r) A 62 pairs of colour 
{B,B) arc changed to {B,R). Note that if r2,7(r) > ^2 then we 
arrive at time T2 < Ti. 

Secondly, at rate A2 a type-2-lysis. This yields the exact same 
transition as described in ( 8). Again, if T'2,i{t) > &2 then we arrive 
at time T2 < Ti. 

It is straightforward to check that the described couphng produces 
the right marginal dynamics. It might be helpful however to explain 
why we replace a pair {B, B) by {B. 0) in (8). Here, the cell that lyses 
is destroyed and one might expect a transition from {R,R) to 0). 
However, since the type 2 cell that lyses infects cells of X2{t), we pick 
one of the newly infected type 2 cells (belonging to a {B, B) pair), and 
twin it with the previous type 1 twin that did not undergo a lysis. 
The effect is the same as replacing a pair (5, B) by 0) and leaving 
(i?, R) unchanged. 

The description above applies until time T2; the construction implies 
that T2 < Ti, proving the first part of the theorem. Since a type 1 cell 
is of colour R only if it is twinned with a type 2 cell of colour R (for 
t < T2), the second part of the theorem also follows. Similarly, a type 
2 cell is coloured B only if it is paired with a type 1 cell of colour B 
(for t < T2), proving the third part of the theorem. 

The construction so far also implies the final part of the theorem for 
the range < t < T2 since every type 2 cell has a twin; the construction 
for t > T2, which we will describe now, will preserve this property. 

For t > T2 the process {Z2{t))t>T2 consists only of infected cells. It 
will be convenient now to think of the cells of type 2 as green. The 
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reason is that in the absence of healthy cells, the process (i2(^))t>r2 
evolves differently compared to when i < T2, as described above and 
in Proposition 3.1. Therefore, we keep the colours blue and red for 
healthy and infected cells of type 1, respectively. At a time t > T2 we 
then have the following possible colour combinations (the right column 
introduces notation for the number of pairs of each colour combination, 
note that we are redefining bg and 61): 



Colour 


Number 




bo 


{B,G) 


hi 


{R,G) 




(i?,0) 


ro 



The following transitions may occur. 

(i) As before, any pair (a, b) is replaced by /c > 2 identical pairs at 
rate pk- 

(ii) A pair of colour {B, 0) or {B, G) is deleted at rate po- 

(iii) A pair of colour {B, G) is, additionally, changed to (i?, 0) at rate 

(iv) A pair (a, b) of colour [R, 0) or [R, G) lyses at rate p^ + Xi. If r is 
the time of such an event, then we delete (a, 6), change the colour 
of V'-^ j^^^ — ri^7(^) A 60 pairs of colour {B, 0) to {R, 0), and finally 
change the colour of (ri^7(^) — r[ j^^-^) A bi pairs of colour {B, G) 
to {R,G). 

(v) Additionally, a pair of colour {R, G) is replaced by a pair (i?, 0) 
at rate A2 — Ai. 

Since a green cell is always twinned with a type 1 cell, this establishes 
the result. 

As before, it is elementary to check that the described coupling pro- 
duces the right marginal dynamics. □ 

Proof of Proposition 3.3. For the first part, consider the models where 
P2 = l,0<A<lis arbitrary, and where "* = 1 and 7^^'' = 1. Clearly 
7(1) < 7(2). Write {X^{t),Yi{t))t>^ and {X^{t),Y^{t))t>o for the pro- 
cesses with parameters (pfc)fc>0! A, 7'-^^ and (pfe)fe>o, A, 7'^^^ respectively. 
Let Xi{Q) = X2(0) and Yi{0) = 1^2(0). There is no interaction between 
{Xi{t))t>o and (^i(t))t>o, and (yi(t))t>o simply forms a supercritical 
branching process (this uses A < 1). 

It is straightforward to couple {Yi{t))t>o with (-'^2(^), ^2(^))t>o so 
that the following hold. Firstly, each transition Yi{t) — ;> Yi{t) -|- 1 is 
accompanied by (X2(t), y2(^)) — ^ (-^2(^)5 ^2 (^) + !)• This simply cor- 
responds to the event that an infected cell is replaced by two identical 
ones; recall that p2 = 1- Secondly, if X2{t) ^ then each transition 
Yi(t) ^ Yi(i) - 1 is accompanied by (X2(i), FaC^)) ^ (^2(t) -l.Y^ii)). 
This corresponds to the death of an infected cell: since 70^"* = 1 such 
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an event simply reduces Yi{t) by 1, and since 7} = 1 one healthy cell 
becomes infected in the second process. Thirdly, if X2{t) — then 
each transition Yi(t) — )■ Yi{t) — 1 is accompanied by {X2{t),Y2{t)) — >■ 
{X2{t),Y2{t) — 1). This corresponds to the death of an infected cell in 
the absence of healthy cells to infect. In such a coupling, Yi{t) < Y2{t) 
for all t > 0, almost surely, so 77(7*^^^) < ?7(7*-^^). It is easy to see that 
the inequality is in fact strict. 

For the second part, we use the result in the forthcoming article [5] 
that the coexistence probability 

C{x)^p{x{t)Y{t)^oyt>o) 

satisfies: C = 0if/3>Q;>0, and C>0ifa>/3>0. Note that r] < 1 
if C > 0. If C(A) = and, in addition, X> a, then it follows from part 
(2) of Proposition 3.1 that 77(A) = 1: either (Y{t))t>o becomes extinct 
before {X{t))t>o, or {X{t))t>o becomes extinct before {Y{t))t>o, and in 
the latter case {Y(t))t>o subsequently forms a branching process which 
has Malthusian parameter /3' = a — X < and therefore becomes 
extinct almost surely. It is easy to check that if po = 3/8, p2 = 5/8, 
^(r) = 4/5, Ai = 7/8 and A2 = 17/8, then /3(Ai) = 3/8 > 2/8 = a 
and Ai > a, whereas f3{X2) = 1/8 < a. Thus C('^i) = and r]{Xi) — 1, 
but C(A2) > so ri{X2) < 1. □ 



5. Discussion 

5.1. Bacteriophage Lambda. The virus Lambda, which preys on 
the bacterium e-coli, has been the subject of intensive research, mainly 
to understand the fascinating lysis-lysogeny behaviour [9, 16]. For this 
virus, a decision between lysis and lysogeny occurs both at the time 
of infecting a new host, as well as after having been incorporated in 
the host's DNA [11]. The switch to lytic behaviour in response to 
stress to the host seems inevitable [16]. In recent years, some exciting 
single-cell studies have investigated the factors determining the decision 
at the time of infection. The results showed strong dependence on 
environmental signals as well as the volume of the infecting cell and 
the number of infecting virions per cell (or multiplicity of infection, [8, 
21, 22]. 

A number of mathematical models have been proposed to study the 
balance between the lytic and lysogenic states [1, 2, 14, 17, 19, 20]. 
With time the models simulated more and more accurately by includ- 
ing newly discovered genetic components, describing a strict bias to- 
wards the lysogenic state as exhibited by 'wet' experiments. Both the 
experimental and theoretical works revealed the molecular mechanism 
of the decision, but did not study the motivation behind such a strict 
bias in the decision system, as we do here. 
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5.2. General conclusion. In this work we liave tried to reveal the 
reason for the observed stability of the lysogenic state. Our results 

suggest that, regardless of the details of the molecular mechanism be- 
hind the decision, the stability of the dormant state is a fundamental 
part of a long-term survival strategy. 

To be specific, in Theorem 1.1 we showed, in the context of our 
model, not only that A = maximizes the survival chance 1 — r){X) of 
the virus, but that ^^(A) is in fact monotonically increasing in A > 0; 
this was done under the assumption 70 = 0. Hcuristically, monotonic- 
ity in A holds for the following reason. As long a.s X{t) 7^ 0, the infected 
process {y{t))t>o behaves as a branching process with a higher expo- 
nential growth rate than {X{t))t>o', this follows from Proposition 3.1, 
equation (7), and the fact that 70 = implies E{r) > 1. Typically, 
therefore, X{t) = for some t, after which point {Y{t+s))s>o by Propo- 
sition 3.1 is a branching process with Malthusian parameter f3' — a — X 
which is decreasing in A. The smaller A is, the larger should be the 
chance that (Y(t + s))s>o survives. 

The intuition above is valid whenever E(r) > 1, supporting the guess 
that the conclusion of Theorem 1.1 should hold whenever E{r) > 1. 
We have only been able to make the intuition rigorous when 70 = 0, 
essentially because then X(t) almost surely (by Proposition 3.2), 
which does not hold if 70 > 0. Interestingly, 1]{X) need not be monotone 
in A if E{r) < 1, as shown in Proposition 3.3; this does not, however, 
rule out the possibility that 77(A) still always attains its minimum at 
A = 0. 

5.3. Decision at the time of infection. The results above concern 
only the decision between lysis and lysogeny after the virus has been 
incorporated in the host's DNA, and not decisions at the time of infec- 
tion. A simple version of a decision at the time of infection can easily 
be incorporated into our model as follows. 

We modify transition rate (iii) in Section 2.1 so that, with a fixed 
probability k > 0, each newly infected cell is immediately replaced 
by a random (independent) number F of infected cells, taken from 
the healthy population X{t). This proceeds recursively for all thereby 
newly infected cells, until there are either no healthy cells left, or the 
recursion terminates by itself. Note that the life-length of an infected 
cell is now a convex combination of an exponential distribution and a 
Dirac mass at 0, but that the process {X{t),Y{t))t>o is still Markovian. 
As mentioned, the recursion terminates at the latest when the healthy 
population X{t) is exhausted. Therefore, there are no transitions of 
'infinite size'. 

The total number of new infections due to the original lysis event 
may be described using a random variable F', whose distribution is 
easily described in terms of F and X{t). The process thus described 
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is not simply the same as our main model with F replaced by F' (one 
easy way to see this is to note that the new process can have transitions 
decreasing X{t) + Y{t) by more than one at a time). However, it is 
possible to modify the proofs of Theorems 4.1 and 1.1 to obtain the 
following: 

Theorem 5.1. In the process with decision at the time of infection 
described above, with 70 = 0, the extinction probability of {Y{t))t>o is 
monotonically increasing in X, {jk)k>o CL^d k. 

Briefly, the required modifications to Theorem 4.1 are the following; 
for notation and terminology, see Section 4. In addition to the param- 
eters Ai < A2 and 7*^^^ ^ 7*^^^ we also have < ki < ^2 < 1. In the 
proof of Theorem 4.1 there were several points where pairs of colour 
{R,R), {B,R), (i?, 0) or {R,G) were created. These transitions are 
still valid, but now, in addition, each newly created {R, R) will itself 
immediately undergo a type-l-lysis with probability Ki, or a type-2- 
lysis with probability K2 — Ki- Similarly, each new {B,R) immediately 
undergoes a type-2-lysis with probability K2, and each new {R, 0) or 
(i?, G) immediately undergoes a lysis with probability ki. The same is 
then done recursively for all thereby newly created pairs {R, R), {B, R), 
(i?, 0) or {R,G). The order in which these 'immediate' transitions are 
carried out is not important. It is easy to see that the conclusions of 
Theorem 4.1 still hold under these modifications. 

Theorem 5.1 says that, when 70 = 0, the optimal 'choice' of A and 
K, for the virus is A = k = 0. Returning to the bacteriophage Lambda, 
which frequently lyses its host cell immediately after infection, we con- 
clude that the model just described is inadequate as a description of 
this virus. The main confounding assumptions are presumably: firstly, 
that 7o = 0; secondly, that there is absolutely no delay between the 
lytic phase and new infections; and thirdly, that factors such as MOI, 
which experiments have shown to be important, are not included. Fur- 
thermore, it is not hard to imagine other factors which could make 
a more rapid increase in numbers beneficial to the virus in the early 
stages of an epidemic, such as competition from other viruses or an 
immune response. It is hoped that relevant modifications of the model 
can be studied in future work. 

5.4. Future directions. The main questions left open by this work 
are: is Theorem 1.1 true whenever E{r) > 1? and, what choices of A 
and (7fe)fe>o minimize rj when E{r) < 1? 

There are many natural ways to modify the model to make it more 
realistic as a model for viruses. One direction would be to instead let 
F be the number of new virions upon a lysis event, and let the number 
of new infections depends also on the ratio of X{t) to Y{t). Another 
direction would be to study the model with decision at the moment of 
infection also when 70 > 0; this requires some new arguments. 
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It is natural to consider the possibility of two competing viruses, 
alternatively a virus competing with an immune system. Finally, it 
would be natural to look at a version of the process which is based not 
on branching process dynamics, but on the dynamics of population 
models having some type of equilibrium like the logistic process [18]. 
Indeed it is reasonable to expect that the cell population will be in 
equilibrium at the time of infection, possibly making such a formulation 
closer to reahty. 
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